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ABSTRACT 

Optimization of primer sequences for polymerase 
cliain reaction (PGR) and quantitative PGR (qPGR) 
and reaction conditions remains an experimental 
challenge. We have developed a resource, 
PrimerBank, which contains primers that can be 
used for PGR and qPGR under stringent and 
allele-invariant amplification conditions. A distin- 
guishing feature of PrimerBank is the experimental 
validation of primer pairs covering most known 
mouse genes. Here, we describe a major update of 
PrimerBank that includes the design of new primers 
covering 17 076 and 18086 genes for the human and 
mouse species, respectively. As a result of this 
update, PrimerBank contains 497156 primers (an 
increase of 62% from the previous version) that 
cover 36928 human and mouse genes, correspond- 
ing to around 94% of all known protein-coding gene 
sequences. An updated algorithm based on our 
previous approach was used to design new 
primers using current genomic information available 
from the National Genter for Biotechnology 
Information (NGBI). PrimerBank primers work 
under uniform PGR conditions, and can be used for 
high-throughput or genome-wide qPGR. Because of 
their broader linear dynamic range and greater sen- 
sitivity, qPGR approaches are used to reanalyze 
changes in expression suggested by exploratory 
technologies such as microarrays and RNA-Seq. 
The primers and all experimental validation data 
can be freely accessed from the PrimerBank 
website, http://pga.mgh.harvard.edu/primerbank/. 



INTRODUGTION 

The quantitative polymerase cliain reaction (qPCR) tech- 
nique, also known as real-time PCR, was developed in the 
early 1990s and subsequently improved or modified by a 
large number of groups (1,2). Both probe-(allele-)specific 
as weU as allele-non-specific technologies have been 
described. Real-time PCR is routinely employed for gene 
expression quantification, and has been widely used for 
the confirmation of results from high-throughput experi- 
ments such as DNA microarray expression profile studies 
or transcript abundance measurements by next-generation 
sequencing. Often the role of qPCR is to vahdate expres- 
sion changes of selected genes showing profiles of interest 
in an initial genome-wide survey (3,4). However, with the 
appropriate design strategy, qPCR can also be performed 
in parallel for thousands of genes in a genome-wide 
fashion. Real-time PCR has several major advantages 
over DNA microarrays, the most important being a 
large dynamic range, high sensitivity and high specificity 
(5,6). Historically, a significant obstacle to genome-wide 
PCR had been the inability to identify robust oligonucleo- 
tide primers that could be used under identical conditions. 
Multiple design algorithms have been proposed (7-10), 
but few of them have been validated experimentally. To 
address the primer design challenge, we developed the 
PrimerBank database, which contains tens of thousands 
of computationally designed as weU as experimentally 
validated primers. The primers can be applied either 
to small-scale experiments focusing on a few genes, or 
to high-throughput qPCR for thousands of genes. To 
allow for an efficient high-throughput qPCR process, 
PrimerBank primers have been designed and validated 
to perform at an invariant annealing temperature of 
60°C. The high annealing temperature helps reduce 
non-specific amplification. The PrimerBank resource and 
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its experimentally validated 26 855 primer pairs covering 
most known mouse genes have previously been described 
(11-13). The design of PrimerBank primers was founded 
in turn on an algorithm for the design of oligonucleotide 
probes for microarrays, with additional criteria that are 
specific to PGR (14). All PGR primers were designed to 
work with sequence-independent fluorescence detection 
methods such as SYBR Green-based qPGR (15,16). 
Thus, the design of target sequence-specific fluorescent 
probes has not been required, leading to significant 
savings. 

In this work, we describe an updated primer design 
process for PrimerBank that reflects the maturation of 
our understanding of the murine and human genomes 
and reflects further refinements in the design strategy for 
primer selection. Gurrent genomic annotations from the 
National Genter for Biotechnology Information (NGBI) 
databases were consohdated as a consequence of the 
update. A total of 91 502 and 98 854 new primers for 
humans and mice, respectively, were designed and 
included in the database. Users can access all stored infor- 
mation (primer sequences and annotations, primer 
validation data, as well as gene annotations) from the 
PrimerBank website (http://pga.mgh.harvard.edu/ 
primerbank/). The new primers cover most known 
human and mouse genes and have been selected from 
niilhons of primer candidates using an improved algo- 
rithm and current genomic information. FoUowing this 
update, PrimerBank contains 497 1 56 primers, covering 
94% of all known human and mouse protein-coding 
genes. All previously designed primers remain in the 
database, together with experimental vahdation data. 
Although several resources have been developed that 
contain collections of PGR primers for gene expression 
analysis, only a few thousand of the alternative primer 
pairs have been demonstrated experimentally (17,18). In 
addition, the primers have not been designed to work 
under the same temperature, so that parallel qPGR 
assays cannot be performed simultaneously. PrimerBank 
is designed to avoid these habihties and has a strong pre- 
dictive record reflected by its high resource utilization and 
high frequency of repeat users. PrimerBank is a 
BioDBcore comphant database (Supplementary 
Information). 

UPDATE ON PRIMER DESIGN 

Gene sequence selection 

The workflow for the update process is presented in 
Figure 1 . In our previous design, protein-coding sequences 
were extracted from GenBank and redundant sequences 
were removed by an automated bioinformatics pipehne 
prior to primer selection (14). The removal of sequence 
redundancy was necessary because each human or 
mouse gene was (and still is) typically represented by 
many sequences in GenBank. In the updated version, 
sequence redundancy has been removed by relying on 
the RefSeq database. In recent years, the NGBI RefSeq 
project has advanced to the point that RefSeq entries span 
nearly all the widely accepted human and mouse candidate 



genes (19). One major goal of the RefSeq database has 
been to manually curate known gene sequences by 
organizing them according to genomic locus. From the 
transcripts found at each locus, representative sequences 
were selected to uniquely represent the genes. In this way, 
sequence redundancy has been greatly reduced and confi- 
dence in the sequence assignment has been greatly 
improved. The RefSeq database is now widely accessed 
to identify known genes, especially those derived from 
human and murine sources. 

We have adopted RefSeq sequences as our template for 
the design of gene-specific PGR primers. The rehance on 
RefSeq only affects the internal primer design process, and 
has no effect on the user search interface at the 
PrimerBank website. In humans and mice, alternative 
splicing is a common phenomenon, leading to multiple 
distinct transcript isoforms for some genes. In the new 
design strategy, each primer pair was designed to cover 
aU known isoforms from a given gene. As the first step, 
the shortest transcript from aU isoforms for a gene was 
selected for primer design. The sequence of the selected 
transcript was scanned from the 5'- to 3'-end and 
sequence regions that were not present in all isoforms 
were excluded from further consideration. In this way, 
each primer pair was designed to quantify the expression 
of a gene by profihng aU known isoforms. 

Primer design process 

With the template sequences available, the next step was 
to apply a stringent bioinformatic algorithm for primer 
selection. This new algorithm is similar to our previous 
algorithm (11), with some modifications. The sequence 
template selection strategy is guided by RefSeq as 
described above. In addition, the workflow of the design 
algorithm has been significantly improved, leading to 
faster performance. In the updated algorithm, the PGR 
amplicon size has been set to default to a narrower 
range than in the initial algorithm (100-250 versus 
100-350 bp initially). A relatively short amplicon size is 
important for PGR amplification efficiency, especially if 
the RNA quahty is low (e.g. extracted from archived 
chnical samples). On the other hand, if the amplicon 
size is weh below 100 bp, it would be a challenge to dif- 
ferentiate specific PGR products from potential 
non-specific primer dimers when using gel electrophoresis 
or PGR melting curves for the specificity check. Most 
selected primer pairs (99.5%) were designed to generate 
amplicons in the default size range. In cases in which the 
size requirement could not be satisfied, alternative 
ranges were adopted for primer selection (60-99 and 
251-550 bp). 

One important design consideration is the homogeneity 
of the PGR assays. All the primers have 7"^ values in the 
range of 60-63° G as calculated by the nearest neighbor 
basepair stability method (20). The narrow range of 7"^ 
is helpful for homogeneous primer annealing, especially if 
multiple assays are to be performed in a single plate. In 
addition, the primer GG content is restricted to 35-65% to 
ensure uniform primer annealing. The design algorithm 
also avoids sequence regions with a high potential for 



D1146 Nucleic Acids Research, 2012, Vol. 40, Database issue 



Bioinformatics algorithm 
for primer design 



Updated Bioinformatics 
algoritiim for primer design 



^ /Old genomic // I 
I annotations / / prir^ers / 



I Updated 7 
> I genomic / 
/ annotations / 



L 



Newly designed 
primers 



7 



Rejected primer 
candidates 



PrimerBank 



Experimental validation 



Updated Web interface 
for primer search 



Failed Failed Failed Failed 
amplification gel sequencing BLAST 



Figure 1. Workflow for the updating process of PrimerBanlc. 



secondary structure formation as site inaccessibility has 
been reported to lead to insufficient primer annealing (11). 

The success of qPCR is dependent on the assay specifi- 
city, since non-specific PCR amplification results in in- 
accurate template quantitation. In the design presented 
here, multiple bioinformatics filters were implemented to 
select primers with high specificity. To avoid non-specific 
anneahng resulting from low template sequence complex- 
ity, the DUST program was used and sequences of low 
complexity were identified and rejected (21). The 3'-end 
nucleotides of a primer contribute significantly to 
non-specific extension of primers, especially if the anneal- 
ing of these nucleotides is favored (22). The design algo- 
rithm evaluates the free energy (AG) for the anneahng of 
the last five bases in a primer, and primer candidates with 
stable 3'-ends (less than — 9kcal/mol) were rejected. For 
SYBR Green-based qPCR (or other similar quantification 
methods), one particular challenge is the formation of 
primer diniers. Previous studies indicate that DNA poly- 
merase extension can be greatly reduced by a single 
mismatch in the 3'-end of a primer (23,24). To pass the 
3'-end criterion, the last four bases in a primer are not 
allowed to anneal perfectly to itself or to any of the can- 
didate primers in a proposed pair. One important specifi- 
city filter is the rejection of primer candidates with a 
stretch of contiguous bases matched perfectly to other un- 
intended transcript templates. The screening for contigu- 
ous base match was done using an efficient computational 
algorithm described previously (14). To further search for 
potential primer cross-reactivity, BLAST search against 
all known sequences in the transcriptonie was also per- 
formed to reject non-specific primers. 




Figure 2. Distribution of all rejected human primer candidates by 
various bioinforinatics screening filters. Combined together, 99% of 
all priiner candidates were rejected by these screening filters. 



Primer design statistics 

During the primer selection process, 99% of primer can- 
didates were rejected for faihng at least one screening 
filter. The distribution of rejected primer candidates for 
human genes is presented in Figure 2. The design statistics 
for mouse primer selection are similar to those for human 
primers. The specificity filters collectively rejected 39% of 
aU primer candidates. About 31% of the primer candi- 
dates were then rejected because they were out of the 
required range. Another major category of screening 
failures was generated by secondary structure (sequence 
complementarity) filters, which collectively rejected 24% 
of the primer candidates. In addition, other screening 
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PrimerBank 

The following primer pair is found for 83641890b1 

Gene Descriptions: _ j 

NCBI GenelD 2597 

GenBank Accession NM 002046 
NCBI Protein Accession NP 002037 

Species Human 

Coding DNA Lengtii 1 008 

Gene Description Homo sapiens glyceraldehycle-3-phosphate dehydrogenase (GAPDH), mPNA. 



Primer Pair Descriptions: 
PrimerBanIi ID 83641890b1 
AmpliconSize 102 

Sequence (5'-> 3") 
Fonftfard Primer aagstgaasgtcsgagtcaac 
Reverse Primer ggggtcattgatggcaacaata 



Length Tm Location 

21 61.7 7-27 



22 



60.6 108-87 



Location in Coding Sequence (primers and amplicon highlighted) 

1 atgggga^^g tg^aggtcgg agtcaacqq^ tttggtcgta ttgggcgcct ggtcaccagg 
61 gctgctttta actctggtaa agtggatfifcfc gttgccatcA AtgAcccctZ cattgacctc 
121 aactacatgg tttacatgtt ccaatatgat tccacccatg gcaaattcca tggcaccgtc 
181 aaggctgaga acgggaagct tgtcatcaat ggaaatccca tcaccatctt ccaggagcga 

?41 rfj^r rr'f^M-r'R ss«r r-sftrft- rf rf rf rf (- rf r rf n r rfrfrrfcr rfsrff s r- rf r n rf r rtrrs rtT r'cf^r'T rfrrc 

Figure 3. A screenshot to demonstrate the PrimerBank search resuh. GAPDH is used here as an example. 



Table 1. Detailed statistics of newly designed human and mouse 
primers 



Number of new 
primer pairs 


Number of 
human genes 


Number of 
mouse genes 


Number of genes 
from both species 


1 


1854 


1647 


3501 


2 


1769 


1537 


3306 


3 


13453 


14902 


28 355 


Total 


17 076 


18 086 


35 162 



filters, such as the requirement to cover ah gene isoforms, 
resuhed in the ehiTiination of a significant number of 
primer candidates. 

Up to three new primer pairs were designed for each 
human and mouse gene. These new primers are designated 
as the version 'b' primers, and assigned PrimerBank IDs 
consisting of the nuiTierical transcript ID followed by the 
suffix bl, b2 or b3 (see Figure 3 for an example). Among 
all the genes covered by the new 'b' primers, the vast 
majority (81%) have three primer pairs (Table 1); the 
rest have one or two 'b' primer pairs. In total, 190 356 



new priiTiers (including all 'bl', 'b2' and 'b3' priiTiers) 
were designed and then imported into the PriiTierBank 
database. Previously designed primers (version 'a' 
primers), from the previous version of the database, 
remain in the updated database with the PrimerBank 
IDs remain unchanged (each consisting of a numerical 
transcript ID followed by the suffix al, a2 or a3). The 
statistics for all PrimerBank primers, including both 
newly designed ones and previous ones, are summarized 
in Table 2. An analysis of current genomic annotations 
with respect to the primer pairs that had been experimen- 
tally vahdated previously is presented in Table 3. 



PRIMERBANK WEBSITE 

The allowed database search criteria remain the same as 
previously, and include the foUowing terms: GenBank ac- 
cession number, NCBI protein accession number, NCBI 
gene ID, PriinerBank ID, NCBI gene symbol and gene 
description (keyword). In addition to the primer pair se- 
quences and corresponding information, the cDNA and 
amplicon sequences have also been added. These can be 
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Table 2. Summary of primer statistics for the updated PrimerBank database 





Human 


Mouse 


Both species 


Number of newly designed primer pairs 


45 751 


49427 


95 178 


Number of previous primer pairs 


83 941 


69459 


1 53 400 


Total number of primer pairs designed 


1 29 692 


1 18 886 


2 48 578 


Number of currently annotated protein-coding genes 


18 932 


20 305 


39 237 


Number of genes represented by new primers 


17 076 


18 086 


35 162 


Number of genes represented by previous primers 


16383 


17038 


33 421 


Number of genes represented by all primers 


17 973 


18955 


36 928 



Table 3. Experimentally validated mouse primers in the context of current genomic annotations 



Number of primer pairs 
representing current genes 



All tested primer pairs 23 465 

Validated primer pairs based on all criteria 15 473 (66%) 

Validated Primer pairs based on gel electrophoresis 19 561 (83%) 

Validated primer pairs based on sequencing and BLAST 17 242 (73%) 

Primer pairs failed qPCR amplification 1331 (5.7%) 



viewed by clicking on the 'cDNA and amplicon sequence' 
link. A screenshot of a search result is presented in 
Figure 3. 

All the newly designed PCR primers have been 
imported into PrimerBank and are accessible via the 
web search interface. In addition, previously designed 
version 'a' primers continue to be stored in the same 
database. Thousands of these previous primers have 
been experimentally vahdated (12). The vahdation data 
are an important consideration for users deciding which 
primer pairs to use. In addition, an unknown number of 
previous primers have been used and referenced in what 
we estimate are hundreds of peer-reviewed publications. 
Thus the previous primers need to be retained to preserve 
hnks that may have been made by prior research efforts 
presented in the hterature. Given that multiple versions of 
primers are stored in PrimerBank, we have developed the 
following strategy to prioritize the primer pairs for web 
presentation: (i) All newly designed 'b' primers for the 
selected genes (up to three primer pairs per gene) are pre- 
sented; (ii) all experimentally validated version 'a' primers 
for the selected genes, along with the validation data, are 
presented; and (iii) in most cases, other non-vahdated 
version 'a' primers are not presented. However, an excep- 
tion is made when only version 'a' primers are available to 
represent the gene. In addition, another exception is made 
when searching by PrimerBank ID whereby all version 'a' 
or 'b' primers can readily be retrieved. In this way, re- 
searchers retain the abihty to obtain the information for 
any primer from any design version when desired. 



DISCUSSION 

Technological advances in transcriptome analysis via tran- 
scriptional microarrays or next-generation sequencing has 
led to a significantly greater demand for experimental val- 
idation of gene expression levels. To this end, real-time 
PCR (qPCR) is widely considered the gold standard for 



validation of high-throughput expression data. One major 
challenge for qPCR is the design of robust PCR primers 
with high efficiency, specificity and homogeneity when 
multiple assays are simultaneously performed. To meet 
these requirements, we have developed a robust bioinfor- 
matics process for primer design. The algorithm has been 
used to design tens of thousands of PCR primers to cover 
most known human and mouse genes, aU of which are 
freely accessible via the PrimerBank website. 

High-throughput qPCR using PrimerBank primers has 
several advantages. The primers work under the same con- 
ditions and thousands have been experimentally vahdated. 
The expression profiles of thousands of genes can be 
determined at the same time, making the primers useful 
for high-throughput nanohter-scale qPCR platforms, such 
as OpenArray from Life Technologies and BioMark from 
Fluidigm, in which thousands of qPCR assays can be per- 
formed in parallel (25,26). 

PrimerBank is currently the largest public database for 
the retrieval of PCR and qPCR primers. In addition, 
PrimerBank contains thousands of experimentally 
validated primers, comprising the largest collection of its 
kind in the public domain. 



SUPPLEMENTARY DATA 

Supplementary Data are available at NAR Onhne: 
Supplementary File SI: List of the database core attri- 
butes in accordance with the BioDBcore standards. 



FUNDING 

National Institutes of Heahh (UOl HL066678 and ROl 
GM089784). Funding for open access charge: Center for 
Computational and Integrative Biology, Massachusetts 
General Hospital. 

Conflict of interest statement. None declared. 



Nucleic Acids Research, 2012, Vol. 40, Database issue D1149 



REFERENCES 

1. Bustiii,S.A. (2000) Absolute quantification of mRNA using 
real-time reverse transcription polymerase chain reaction assays. 
/. Mol. Endocrinol, 25, 169-193. 

2. Walker,N.J. (2002) Tech. Sight. A technique whose time has come. 
Science, 296, 557-559. 

3. Canales,R.D., Luo,Y., Willey,J.C., Austermiller,B., 
Barbacioru.C.C, Boysen,C., Hunkapiller,K., Jensen,R.V., 
Knight,C.R., Lee,K.Y. et al. (2006) Evaluation of DNA 
microarray results with quantitative gene expression platforms. 
Nat. Biotechnol., 24, 1115-1122. 

4. Arikawa,E., Sun,Y., Wang,J., Zhou,Q., Ning,B., DialS.L., 
Guo,L. and Yang,J. (2008) Cross-platform comparison of 
SYBR Green real-time PGR with TaqMan PGR, microarrays 
and other gene expression measurement technologies evaluated 
in the MicroArray Quality Gontrol (MAQG) study. BMC 
Genomics, 9, 328. 

5. VanGuilder.H.D., Vrana.K.E. and Freeman,W.M. (2008) 
Twenty-five years of quantitative PGR for gene expression 
analysis. Bioteclmiqiies, 44, 619-626. 

6. Wong,M.L. and Medrano,J.F. (2005) Real-time PGR for mRNA 
quantitation. Biotechnique.i, 39, 75-85. 

7. Marshall,O.J. (2004) PerlPrimer: cross-platform, graphical primer 
design for standard, bisulphite and real-time PGR. Bioinformatic.s, 
20, 2471-2472. 

8. Podowski,R.M. and Sonnhammer,E.L. (2001) MEDUSA: large 
scale automatic selection and visual assessment of PGR primer 
pairs. Bioinformatics, 17, 656-657. 

9. Kim,N. and Lee,G. (2007) QPRIMER: a quick web-based 
application for designing conserved PGR primers from 
multigenome alignments. Bioinformcnics, 23, 2331-2333. 

10. You,F.M., Huo,N., Gu,Y.Q., Luo,M.G., Ma,Y., Hane,D., 
Lazo,G.R., Dvorak,J. and Anderson, O.D. (2008) BatcliPrimer3: a 
high throughput web application for PGR and sequencing primer 
design. BMC Bioinformatics, 9, 253. 

11. Wang,X. and Seed,B. (2003) A PGR primer bank for quantitative 
gene expression analysis. Nucleic Acids Res., 31, el54. 

12. Spandidos,A., Wang,X., Wang,H., Dragnev,S., Thurber,T. and 
Seed,B. (2008) A comprehensive collection of experimentally 
validated primers for Polymerase Ghain Reaction quantitation of 
murine transcript abundance. BMC Genomics, 9, 633. 

13. Spandidos,A., Wang,X., Wang,H. and Seed.B. (2010) 
PrimerBank: a resource of human and mouse PGR primer pairs 
for gene expression detection and quantification. Nucleic Acids 
Res., 38, D792-D799. 



14. Wang,X. and Seed.B. (2003) Selection of oligonucleotide probes 
for protein coding sequences. Bioinformatics, 19, 796-802. 

15. Morrison.T.B., Weis,J.J. and Wittwer,G.T. (1998) Quantification 
of low-copy transcripts by continuous SYBR Green I monitoring 
during amplification. Bioteclmicjiies, 24, 954-958, 960, 962. 

16. Wittwer,G.T., Herrmann,M.G., Moss,A.A. and Rasmussen,R.P. 
(1997) Gontinuous fluorescence monitoring of rapid cycle DNA 
amplification. Bioteclmicjiies, 22, 130-131, 134-138. 

17. Lefever,S., Vandesompele,J., Speleman,F. and Pattyn,F. (2009) 
RTPrimerDB: the portal for real-time PGR primers and probes. 
Nucleic Acids Res., 37, D942-D945. 

18. Gui,W., Taub,D.D. and Gardner,K. (2007) qPrimerDepot: a 
primer database for quantitative real time PGR. Nucleic Acids 
Res., 35, D805-D809. 

19. Maglott,D., Ostell,J., Pruitt,K.D. and Tatusova,T. (2007) Entrez 
Gene: gene-centered information at NGBI. Nucleic Acids Res., 35, 
D26-D31. 

20. SantaLucia,J. Jr (1998) A unified view of polymer, dumbbell, and 
oligonucleotide DNA nearest-neighbor thermodynamics. Proc. 
Natl Acad. Sci. USA, 95, 1460-1465. 

21. Hancock,J.M. and Armstrong.J.S. (1994) SIMPLE34: an 
improved and enhanced implementation for VAX and Sun 
computers of the SIMPLE algorithm for analysis of clustered 
repetitive motifs in nucleotide sequences. Comput. Appl. Biosci., 
10, 67-70. 

22. Rychlik.W. (1995) Priming efficiency in PGR. Biotechniques, 18, 
84-86, 88-90. 

23. Kwok,S., Kellogg,D.E., McKinney,N., Spasic,D., Goda,L., 
Levenson,G. and Sninsky,J.J. (1990) Effects of primer-template 
mismatches on the polymerase chain reaction: human 
immunodeficiency virus type 1 model studies. Nucleic Acids Res., 
18, 999-1005. 

24. Huang,M.M., Arnheim,N. and Goodman, M.F. (1992) Extension 
of base mispairs by Taq DNA polyinerase: implications for single 
nucleotide discrimination in PGR. Nucleic Acids Res., 20, 
4567-4573. 

25. Morrison,T., Hurley,J., Garcia,J., Yoder,K., Katz,A., Roberts, D., 
Gho,J., Kanigan,T., Ilyin.S.E., Horowitz,D. et al. (2006) 
Nanoliter high throughput quantitative PGR. Nucleic Acids Res., 
34, el23. 

26. Spurgeon,S.L., Jones, R.G. and Ramakrishnan,R. (2008) High 
throughput gene expression measurement with real time PGR in a 
microfluidic dynamic array. PLoS One, 3, el662. 



